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Abstract 

In this paper, we consider the minimal entropy of qubit states transmit- 
ted through two uses of a noisy quantum channel, which is modeled by the 
action of a completely positive trace-preserving (or stochastic) map. We 
provide strong support for the conjecture that this minimal entropy is ad- 
ditive, namely that the minimum entropy can be achieved when product 
states are transmitted. Explicitly, we prove that for a tensor product of two 
unital stochastic maps on qubit states, using an entanglement that involves 
only states which emerge with minimal entropy cannot decrease the entropy 
below the minimum achievable using product states. We give a separate 
argument, based on the geometry of the image of the set of density matrices 
under stochastic maps, which suggests that the minimal entropy conjecture 
holds for non- unital as well as for unital maps. We also show that the max- 
imal norm of the output states is multiplicative for most product maps on 
n-qubit states, including all those for which at least one map is unital. 
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For the class of unital channels on , we show that additivity of minimal 
entropy implies that the Holevo capacity of the channel is additive over two 
inputs, achievable with orthogonal states, and equal to the Shannon capacity. 
This implies that superadditivity of the capacity is possible only for non- 
unital channels. 
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1 Introduction 



1.1 Entropy and unital stochastic maps 

When a pure state, represented by a density matrix p, is transmitted along a noisy 
channel, it is mapped into a mixed state $(p). The entropy of the initial pure 
state is necessarily zero, i.e., S{p) = —Trp logp = since = p and so the only 
eigenvalues of p are and 1. However, the entropy ^[^(p)] of the mixed state which 
emerges need not be zero. One seeks states p which minimize the effect of the noise 
in the sense of minimizing the entropy S'[$(p)] of the state that emerges from the 
channel. There are a number of reasons for studying such states, most notably the 
connection between minimizing entropy and maximizing channel capacity, which 



will be discussed in Section p73| . However, in this paper we focus attention on the 
entropy. 

The noise, which results from interactions between the states in a Hilbert space 
Ti. and the environment, is represented by the action of a completely positive, 
trace-preserving map $ on the trace class operators in B{T-C). We use the term 
stochastic to describe such maps. (Following a similar use by Alberti and Uhlmann 

this terminology was used by Petz, [^] and reflects the fact that $ is the non- 
commutative analogue of the action of a column stochastic matrix on a probability 
vector.) We restrict attention to two- level quantum systems in which case Ti = 
or tensor products of C^. A stochastic map $ acting on states in C^, can be 
naturally extended to tensor products, e.g., $®$ acting on states on C^(8>C^ etc., 
and leads to questions about the additivity of the minimal entropy and capacity 
of product channels. 

In particular, Shor has conjectured that the minimal entropy is additive. We 
were led independently to this conjecture because it would imply additivity of 
channel capacity for unital stochastic maps, i.e., maps which take the identity 
operator to itself so that $(/) = /. Although we have a convincing argument that 
our results for entropy, unlike those for channel capacity, extend to non-unital 
maps, we focus most of our attention on unital maps. In the last section we briefly 
consider non-unital maps. 

Recall that every completely positive map $ can be represented (non-uniquely) 
in the Kraus form 

k 

Every map $ representing noisy evolution in a quantum channel must preserve the 
trace of p, since $(p) is also a state. In terms of the Kraus operators, the condition 
that $ be stochastic, that is completely positive and trace preserving, is 

n 

Tr<l>(p)=Trp Vp ^ ^^4 = /. (2) 

fc=i 
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The map $ is unital if $(/) = /, that is if $ maps the identity operator to itself. 
In terms of the Kraus operators, the condition for $ to be completely positive and 
unital is 



$(/)=/ ^ 5^4^, = /. (3) 



fc=i 



A sufficient condition that a stochastic map be unital is that the Kraus operators 
are self-adjoint, i.e.. At = \/k. This condition is not necessary; for example, a 
double-stochastic matrix which is not symmetric corresponds to a unital stochastic 
map which is not self-adjoint. Henceforth in this paper, "unital map" will mean 
"unital stochastic map" unless otherwise stated. 

It is worth noting that the Kraus operators are self-adjoint if and only if $ is self- 
adjoint with respect to the Hilbert-Schmidt inner product (P, Q) = TtP'^Q. The 
dual or adjoint map $ is then defined by the condition Tr[$(P)]^"(5 = TrP'''<I>(Q). It 
is easy to see that if $ is given by (|I]) then <f>(p) = J2k ^kpA^. In addition, since the 
dual of any trace-preserving map satisfies $(/) = /, any stochastic map (considered 
as a linear map on the space of Hilbert-Schmidt operators) has eigenvalue 1, and 
hence a fixed point P such that $(P) = P. 

For unital maps, the identity is an eigenvector whose orthogonal complement 
is the set of operators with trace zero. Hence, a unital map also defines a linear 
map on the traceless part of a density matrix. By contrast, a non-unital map is 
only affine on the set of traceless matrices. This distinction is easily seen for the 

case when the Bloch sphere representation is used as described in section pTT . 
Appendix C contains a list of examples of unital and non-unital maps. 

Recall that the entropy of a density matrix can be written in terms of its 
eigenvalues A^, namely S{p) = — ^^AfclogAfc. The minimal entropy S{p) = 
occurs if and only if one eigenvalue of p is 1 and all others 0; the maximal entropy 
(in d dimensions) of S{p) = logd occurs if and only if all eigenvalues are 1/d so 
that p = Thus, if S{p) ~ 0, one must have one eigenvalue close to 1 and the 
others near 0. Hence, states with small entropy are those for which ||p|| ^ 1. Thus, 
in seeking pure states p which have minimal entropy ^[^(p)] after emerging from a 
noisy channel, we are led to seek states for which ||$(p)|| is maximal. In section ^ 
we give a precise definition of the maximal norm and show that it is multiplicative 
when (at least) one channel is a unital map on C^. 

Our results suggest that multiplicativity of the maximal norm may hold for 
general channels; in fact, we can extend our result to some non-unital channels 
(see Remarks at the end of Section 0). In any case, our results provide strong 
support for the additivity of minimal entropy for unital channels. 

Another property of unital channels is that the entropy of a state is non- 
decreasing under the action of a unital stochastic map. This follows easily from 
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the fact that the relative entropy 

H{P, Q) = TrP[log P - log Q] (4) 

decreases under stochastic maps, i.e., 

H[^P),^Q)]<HiP,Q) (5) 

Since S{p) = -H{p, \l) + \ogd if $ is unital, it follows from (D that 5'[$(p)] > 
S{p). For a non-unital map, the entropy of a pure state cannot decrease; however, 
one can have mixed states for which the entropy actually decreases. 

1.2 Channel Capacity 

We now discuss the information capacity of a noisy quantum channel [jl2|, [|13| 
used for what is sometimes called "classical" communication, i.e., communications 
in which signals are sent using quantum particles but without additional or prior 
entanglement between sender and receiver. In the simplest case where no entangle- 
ment is used in either the transmission or the measurement, each letter i from the 
source alphabet is represented by a pure state which we represent by its density 
matrix pi on a quantum Hilbert space. During transmission the channel transforms 
this state into pi = $(pi), where $ implements the noisy interaction between states 
and the environment. The map $ is a completely positive, trace-preserving map on 
the set of states. The resulting state pi is measured, and the outcome determines 
a letter from the output alphabet. In the general case this measurement is effected 
by a positive operator-valued measurement (POVM) - namely, there is a positive 
operator Ej assigned to each letter j of the output alphabet, which together satisfy 
the constraint Ej = L When the measurement is performed on a state p, the 
result will be j with probability Ti[pEj). 

Several definitions of channel capacity have been proposed, corresponding to 
whether or not entangled states are used for transmission, and whether or not 
entangled measurements are made by the receiver. Bennett and Shor |Q identify 
four possibilities, which we denote Cpp,Cpe,Cep and Cee where the subscripts 
P and E refer to product and entangled processes respectively. 

In the process described above, with no entanglement at either end, the chan- 
nel is equivalent to a classical noisy channel with transition probabilities {pij = 
Tr[piEj)} . Therefore its maximum rate of information transmission is given by 

Cppi^) = Cshan($) = sup V V mp,, log ) , (6) 

where we take the sup over all probability distributions {tTj} for the input alphabet, 
as well as all choices of input states and measurements. We call this the Shannon 
capacity of the channel, since it bears closest resemblance to the classical case. 
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It is reasonable to expect that by transmitting entangled states and by using 
entangled measurements it may be possible to exceed the Shannon capacity for 



a noisy channel. The Holevo-Schumacher- Westmoreland Theorem [|T2|, ||2^ pro- 
vides a closed form expression for the capacity in the case where product states 
are transmitted at the input and entangled measurements of arbitrary length are 
allowed at the output: 

CpE = Cho1v($). (7) 

Here Cho1v('^') is the Holevo capacity of the channel: 

Cho1v($) = sup (S{p) - J2 ^^S{P^)) , (8) 

where p = Yli'^iPi ^^"^ P = ^{p)- The well-known Holevo bound states that 

C^Shan($) < Cho1v($). (9) 

Holevo [T^, Q provided examples of channels in which this inequality is strict, i.e., 
Cshan('^') < Cho1v('^')- Furthermore, it has been shown [0, |l8l that a necessary 



and sufficient condition for strict inequality is that the output states {pi} do not 
commute. 



One important open question is whether or not the Holevo capacity can be 
exceeded when entangled states are used at the input, that is whether Cee exceeds 
CpE- This would be equivalent to the superadditivity of the Holevo capacity. In 
this paper we address this question in the case of messages which are entangled 
over two inputs only. This is equivalent to the question whether Choi('^' ® $) 
exceeds 2Choi('^')- Holevo ^ has shown that Cshan('^' ® > 2Cshan('^') for the 
quantum binary channel, but to the best of the authors' knowledge there is no 
known example of a superadditive channel for the Holevo capacity. Bruss et al 
1^ showed that Choi('^' ® $) = 2Choi('^') for the depolarising channel, which is an 
example of a unital channel. As we will show, our results strongly suggest that if 
the Holevo capacity is superadditive then the channel must be non-unital. 



1.3 Summary of Results 

We prove several theorems about the minimal entropy and the maximal norm for 
states of the form ($ (g) Q){pi2), where $ and fl are unital stochastic maps on C^^^ 
and pi2 is an entangled state. In addition, we explain how these results provide 
evidence for the conjecture that minimal entropy is additive for all stochastic maps 
on C^^^. We also show that this conjecture has important implications for the 
capacity of unital quantum channels. In particular, we show that the conjecture 
implies that if $ is unital, then the Holevo capacity is additive over two inputs, 
that is Cho1v($ ® $) = 2Cho1v($). 
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Our first theorem concerns the maximal value of ||$(p)|| as p varies over states 
on C^. We will consider the general possibility of two stochastic maps $ and Vt on 
C^^^ and denote their maximal values by M$ and Mq respectively. In Theorem 
^ we prove, under mild conditions on one of these maps, that the maximal value 
of II ($ ® VL){pi2)\\ is Mis,Mn, as pu varies over states on C^^^. That is, the norm 
of ($ ® Q){pi2) achieves its maximal value on product states pi2 = pi ® P2, rather 
than on entangled states. 

In Theorem |12|, we prove a similar, though slightly weaker, result for the min- 
imal entropy of ($ (S> fi)(pi2). Namely, we restrict pu to the family of entangled 
states whose reduced density matrices pi and p2 are such that $(pi) and ^^(p2) 
have minimal entropy. Then we prove that the minimal entropy of ($®i^)(pi2), as 
Pi2 varies over this family, is the sum of the minimal entropies of $(pi) and fl{p2)- 
That is, the entropy also achieves its minimal value on product states. 

It seems extremely unlikely that we could further decrease the entropy of ($ ® 
^)(Pi2) by using an entangled state whose reduced density matrices do not have 
minimal entropy themselves. Therefore we believe that the conclusion of Theorem 
12 also holds as pi2 varies over all entangled states on C^^^. In fact, similar 
arguments and numerical evidence support an even stronger conclusion, namely 
that minimal entropy is additive for all stochastic maps on C^^^. This is the 
content of the conjecture below. 

Conjecture 1 //$ and Q are stochastic maps on C^^^, then 



inf S 

p pure 



p pure 



$(g,l])(p) = inf S $(p) + inf S n{p) . (10) 



p pure 



We will discuss the evidence for this conjecture in detail in Section ^ and Section 
572^. We note that P. Shor P3| earlier made a similar conjecture, and together with 



J. Smolin obtained numerical evidence which supports it. 

The unital case is particularly important because it yields the following results 
as immediate corollaries. 

Corollary 2 // $ is unital, then the Holevo capacity is additive, i.e., 

Cho1v($®$)=2Cho1v(*). 



Corollary 3 If ^ is unital, then the Holevo capacity can be achieved with orthog- 
onal states. 



Corollary 4 //$ is unital, thenCuoivi^) = Cshan('^') and CnoiYi^®^) = Cshan('^'® 
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Corollary 5 If ^ is unital, then the Shannon capacity is also additive, i.e., 

Cshan($®$) =2Cshan(<f). 



In Section 2.3 we will explain in detail how these Corollaries follow if Conjecture 



|I| holds for unital maps. 

This paper is organized as follows. In Section |^ we introduce the notation we 
will use for the Stokes parametrization for representing both states and maps in 
a basis consisting of the Identity and Pauli matrices, and show how the various 
conditions of unital, trace-preserving, and complete positivity can be expressed in 
this representation. With this background, we conclude Section ^ by presenting 
the arguments leading to the Corollaries above. 

In Section |^ we prove the multiplicativity of the maximum value of 
Section ^ contains the heart of the paper in which we give the details of the proof 
of our theorem about additivity of minimal entropy. In Section | we discuss some 
of the features of non-unital maps using a special subclass and then present the 
evidence for additivity of minimal entropy in general. Finally, in Section ^ we 
summarize our results and discuss their implications for channel design. 

We also include three Appendices. Appendix A gives some important back- 
ground on singular value decompositions and the details needed for the diagonal 
representation introduced in Section |2.1| . Appendix B gives the details needed to 
verify the complete positivity conditions of Section |2.2| . In Appendix C we provide 
a number of examples of different types of channels and show how some familiar 
examples appear in the representation and notation we use. 



2 Preliminaries 

2.1 Stokes parametrization and Bloch sphere 

Recall that the identity and Pauli matrices form a basis for C^^^ so that any 2x2 
matrix C can be written as WqI + w-a where a denotes the vector of Pauli matrices 
and w G C^. Then for C = WqI + w-cr 

a) C is self-adjoint {wq, w) is real, i.e., € R and w G R^; 

b) TrC = 1 Wo = |; and 

c) C > -^^ |w| < wq. 

Thus, {/, cr} also form a basis for the real vector space of self-adjoint matrices in 
C^^^ and every density matrix can be written in this basis as p = |[/ + w ■ a] with 
w G R^ and |w| < 1. Furthermore 

d) p is a one- dimensional projection (or pure state) -v^ |w| = 1. 
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Every linear map $ : C^^^ q2x2 represented in this basis by a 

unique 4x4 matrix T, and $ is trace-preserving if and only if the first row satisfies 

tik = ^ik, i-e., T = ^ ^ ^ ^ where T is a 3 x 3 matrix (and and t are row and 

column vectors respectively) so that 

^{wqI + w ■ a) = WqI + {t + Tw) ■ a. (11) 

The matrix T is self-adjoint if and only if $ is self-adjoint as an operator on C^^^ 
with respect to the Hilbert-Schmidt inner product. We are interested in those $ 
which map a subspace of self-adjoint matrices into itself, which holds if and only 
if T is real. 

The map $ is unital if and only if t = 0. Thus, any unital stochastic map $ 
acting on density matrices on can be written in the form 

$(![/ + wa]) = i[/ + (Tw)-a], (12) 

where T is a real 3x3 matrix. Using the singular value decomposition (see 
Appendix A), we can write 

T = RS (13) 

where i? is a rotation and S is self-adjoint. Define the map $5 by 

$s(i[/ + wa]) = l[/ + (5w)-o-] (14) 

As explained in Appendix A, the rotation R defines a unitary operator U such 
that for any state p, 

$(p) = t/[$5(p)]f/^ (15) 

In this paper we are interested only in the critical values of certain functions of 
the spectrum of $(p), as p varies over the space of states - the maximum value 
of the norm, the minimum value of the entropy. Since a unitary transformation 
leaves the spectrum unchanged, these are the same for $ and $5. Also, since S 
is self-adjoint it can be diagonalized by a change of basis. Hence without loss of 
generality we need henceforth consider only unital stochastic maps whose matrix 
T defined in (|Tl|) is diagonal, with eigenvalues (Ai,A2,A3). As a shorthand, we 
will denote this diagonal map by $[Ai, A2, A3]. The image of the set of pure state 
density matrices p = |[/ + w-cr] (with |w| = 1) under the action of $[Ai, A2, A3] is 
the ellipsoid 
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and the image under the action of $ is obtained by a further rotation of the 
elhpsoid, corresponding to the operator U in (|T^. 

Similar reasoning apphes when $ is non-unital. Using ( [75D and (|79D from 
Appendix A, the map $ can be written in the form $(p) = U^DiVpV'')W where 
U, V are unitary, D is diagonal and is represented by the matrix 



T 



/I \ 

t'l Ai 

t'2 Ai 

\t'3 As/ 



(17) 



The vector t' = (t'l, t'2, t's) is equal to R2Ri^t in the notation of (ffSf). In this case, 



the image of the set of pure state density matrices p = |[/ + w-cr] (with |w| 
under the action of is the translated ellipsoid 



1) 



Xi — t'l 

Ai 



+ 



X2 - 1'2 
Ao 



+ 



X3 



t'. 



A, 



:i8) 



and again the image under $ is a rotation of this. 



It will be useful to write out explicitly the action of the diagonal unital map 
$[Ai, A2, A3] on a density matrix in the form 



$[Ai, A2, A3](p) = $[Ai,A2,A3 



a b 
fot c 



(19) 



(a + c) + Xsia - c) Ai(6 + fet) + A2(6 - fet) 
Ai(6 + 6t) _ A2(6-6t) (a + c) - A3(a-c) 



2.2 Complete Positivity Conditions 



The requirement that $ be stochastic imposes a number of constraints on the ma- 
trix T. We describe these in Appendix B in which we give explicit formulas for the 
matrix elements of T in terms of the Stokes parameterization of the operators Ak. 
These formulas in turn imply constraints on the eigenvalues (Ai, A2, A3) described 
in the previous section. 

Let Tjk denote the elements of T using the convention that j, /c G . . . 3. Then 
the point with coordinates (Tn, T22, T33) must lie inside a tetrahedron with corners 
at (1, 1, 1), (1, —1, —1), (—1, 1, —1), (—1, —1, 1). These conditions are equivalent to 
four linear inequalities which can be written compactly as 

|Tii±T22| < |1±T33|. (20) 

(Note that we always have Too = 1-) 
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In the special case where $ is unital, (^) implies that the eigenvalues (which 
are necessarily real) satisfy 

|Ai±A2| < IliAsI- (21) 

In fact, for unital $ the condition (|2T|) is a necessary and sufficient condition for 
the numbers (Ai, A2, A3) to arise as eigenvalues of the self-adjoint part of a unital 
stochastic map. 

These conditions were discussed earlier by Algoet and Fujiwara 0. In addition 
they gave conditions for complete positivity of some non-unital maps. In particular, 
for the special case of (|l^ with the form 

$(![/ + w-a]) = I [/ + WiXicri + {t + W3X3)(T3] , (22) 

they showed that the necessary and sufficient condition for complete positivity is 

A? + t'<(l-|A3|)' (23) 

2.3 Relation between capacity and minimal entropy for 
unital maps 

We now assume wlog that $ is self-adjoint and written in the diagonal form 
^['^i) ^^^25 A3]. Let fi = max(|Ai|, IA2I, IA3I), and let be a unit vector satisfy- 
ing Tw^ = ±/iw^. Then it is easy to show that 

inf ^[$(p)] = s[m[I + w,.a])) = h{^) (24) 

where 

/i(/i) = -i(l + p) In i(l + /i) - i(l - /i) In i(l - /i). (25) 

Consider now the question of computing the Holevo capacity Choi('^*) defined 
in (I). The choice pi = |[/ + w^-cr] and p2 = |[/ — w^-cr], and Hi = 112 = |, 
both maximizes the first term S{p) = S{^I) = ln2 and minimizes the second 
term Yl'^i^iPi) ~ h{p). Hence it also maximizes their difference, which gives 
Choi($) = ln2-/i(/i). 

In Section [L^ we stated several corollaries of Conjecture Here we will show 
how these corollaries follow from the assumption that minimal entropy is additive 
for unital maps. 

So suppose that Conjecture |l] holds for unital maps, that is suppose that the 
minimum value of ® ^{pu)) over states pu on C^^^ is 2h{p). Then there are 
four product states, namely pi = |[J± w^-cr] ® |[/± w^-a], such that ®^{pi)) 
achieves this minimum value for each i. If we take tTj = 1/4 for each i, then 
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p = 1/41 ^ I and S'($ ® $(p)) = ln4 achieves its maximum possible value. Hence 
with these choices, we can separately maximize each term on the right side of 
(H) and therefore maximize the Holevo capacity. Therefore, for unital maps the 
equality Cuo\{^®^) = 2Choi('^') is implied by the minimal entropy conjecture. This 
demonstrates Corollary 2. Furthermore, the two minimal entropy states |[/±w^-o"] 
are orthogonal. Hence both Choi(^) and Choi(^®$) are achieved with orthogonal 
states, and this establishes Corollary 3. Also, a simple calculation shows that the 
expression inside the sup in the definition of Cshan('^') in (H) equals Choi('^') when 
we choose the input states to be pj = |[/ ± w^-cx] with vr-j = i, and the POVM to 
be Ei = |[/±w^-(t]. This shows the first statement of Corollary 4, and the second 
statement follows immediately. Then Corollary 5 is a direct consequence. 

The essential observation in this argument is that we can find a partition of 
unity in terms of a set of orthogonal input states which are mapped into a level set of 
minimal entropy. For such inputs, uniform averaging yields the state p = I whose 
output $(p) = / has maximal entropy. Hence both terms in the Holevo capacity 
are simultaneously maximised. On orthogonal inputs have the form |[/±w-ct], 
and the corresponding output states have the same entropy if and only if $ is unital. 
In that case, the products of these states form a set of orthogonal inputs on 
which map onto a level set of entropy under the product map If the minimal 

entropy is additive, one such set of product states will be mapped onto a set of 
minimal entropy. For non-unital maps on C^, and more general (non-product) 
maps on or C^, this need not hold. (Fuchs and Shor [El] have found an explicit 



example of a map on which does not have such a set of orthogonal inputs.) 
Hence the above argument is quite special and does not provide a direct link 
between the additivity of minimal entropy and additivity of the Holevo capacity. 



3 Upper Bound on Norm 

For any linear map Q define 

Mn^ sup ||^^(p)||=supHM (26) 

peDenMat Q>0 -L^V 

so that for any ^ > 0, ||f^(^)|| < MqTtq. Since the matrix norm || ■ || used in (p^) 
is convex, it suffices to consider the supremum over pure states or, equivalently 
one-dimensional projections p = Then Mq can be rewritten using the 

representation (|l|) 

Mo = supJ]|(x,A,^)|' (27) 
where the supremum is taken over those vectors satisfying = \x\ = 1- 
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We restrict attention now to unital maps. As discussed in Section |2.1| , wlog 
we assume that $ is diagonal of the form $[Ai,A2,A3]. Then it follows from the 
discussion in section that 

M.I, = |(l + maxfc|Afc|). (28) 

In this section we will show that for unital maps on C^^^, the norm M$ is 
multiplicative, i.e., M^^q = M^Mq. In fact, we will show a slightly stronger 
result. 

Theorem 6 Let Q be any 2-positive map on C"^" and let ^ be a unital stochastic 
map on C^''^ Then M^c^q = M$Mn. 

Notice that Theorem ^ implies that ||($ ® r2)(p)|| is maximised on product 
states of the form p = pi® p2, where ||$(pi)|| = M$ and ||n(p2)|| = Mq. 



Our proof will need the following well-known result. (See, e.g., [[15 



Lemma 7 Let ^ ~ (^^"i be a matrix in block form with A,C > . Then S 

is (strictly) positive definite (i.e. S > 0) if and only if A > BC^^B'^ if and only if 
C > B^A~^B. 

As immediate corollaries we find 

Corollary 8 Let ^ ~ (^^j be a matrix in block form with A, C positive 

semi-definite. Then S is positive semi-definite if and only if for all u > one of 
the following two equivalent conditions holds 

A + uI > B{C + uI)-^B^ 
C + uI > B\A + uI)-^B. 

Corollary 9 If S = (^^^^ - °' ^^^^ 

WBW^ = \\BB^\\<\\A\\\\C\\. (29) 

To prove this note that 

(v.BB^v) < \\C + uI\\{v,B{C + uI)-^B^v) 

< \\C + uI\\{v,{A + uI)v)) < \\C + uI\\\\A + uI\\\\v\\\ 

Choosing v an eigenvector of BB"^ and letting u proves (p9|). 
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pi 7 
7"^ P2 



be a density matrix 



Returning to the proof of Theorem |g, let p = 

on ® C" written in block form with pi, P2, 7 each n x n matrices. First observe 
that the two-positivity of Q implies that 



> 



(30) 



Hence, it follows from (^) that 

l|fi(7)f = II^^(7M7)^II < I|f^(pi)llll^^(p2)|| 

< TrpiTrpaM^ 



(31) 



Now use the form (p!9[) and the linearity of Q to write 

$ ® (p) = 



P L 
Q 

^[pi +P2 + Xsipl - P2)] (Ai + X2M1) + (Ai - A2)f^(7)^ 
(Ai + A2)l^(7)^ + (Ai - A2)fi(7) ^[pi +P2- Hpi " P2)] 



(32) 



Note that the complete positivity of $ implies that pi + P2 + A3(pi — P2) > 
Thus if a; = Trpi 



\P\\ < MniTr[pi+p2 + A3(pi-p2)] 



Mr, 



and 



(33) 
(34) 



IIQII < Mn[|-A3(x-i) 

Now we can assume wlog that A3 = max^ |Ajt| so that M<j, 
to prove Theorem ^, it suffices to show that 

z > 1(1 + A3)Mn ^ zJ - ® 1]) (p) > 0. 

Note that 

||LLt||<(z-||P||)(z-||g||) 

and the general property P < \\P\\ imply 

L{zI-P)-^L^ < L{z-\\P\\)-^L^ <\\LLm{z-\\P\\)-^ 
< (z-\\Q\\)<zI-Q. 

Therefore, by Lemma 0, to verify (^), it suffices to show (|36|). But it follows from 
ni) that 



i(l + A3). Then 
(35) 

(36) 



WLL^W 



(Ai + + (Ai - A2)^^(7^)J [(Ai + A2)^^(7)^ + (Ai - X2M1) 

< i [(Ai + A2)2 + 2|Ai + A2I |Ai - A2I + (Ai - X^Y] m^)f 

< Xixil-x)Ml 
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where we have used \\n{-f)f = ||^^(7)^f = \\n{-f)n{-f)m 
also imply 

{zl - \\P\\){zI - WQW) > (A3(1 - x)Mo) (xX.Mn) - 
Since we have assumed A3 > A^, these inequalities imply 

Remark: At this point the only use we made of the unital character of $ was 

(a) to give a specific formula for M<p 

(b) to use the special form (|1^) of representing $. 

It is possible to generalize these formulas to some non-unital stochastic maps. 
Any stochastic map $ can be written in the form ([T7|) , where T is the 4x4 matrix 
which represents its action on {I, ai, 0-2,0-3). Suppose Ai is the smallest diagonal 
entry of T. If ti = 0, then the above method can be extended in a straightforward 
way to deduce that $ also satisfies the conclusion of Theorem ^ for any values 
of t2, ^3 allowed by complete positivity. That is, as long as we do not translate 
the ellipsoid in the direction of its shortest major axis, the conclusion still holds. 
This is consistent with the conclusions of Section |^, where we argue that this is 
the hardest case to analyse. The difficulty occurs when the two other major axes 
have equal lengths, so that the ellipsoid is a 'flying saucer'. This produces a circle 
of states of maximal norm and minimal entropy in the ellipsoid. It is necessary to 
show that no entanglement of these minimal entropy states can increase the norm 
above the product bound, or can lower the entropy below the product sum. We 
discuss this situation further in ( p.2| ). 

4 Minimal Entropy Analysis 
4.1 Reduction via Convexity 

Lemma 10 Let Q be unital stochastic maps with M$ and Mq equal to fi and 
V, respectively. Then 

inf ^($®fi)(p)> inf sf$[/i,M,/i] ® fi[i/,t;,i/]Vp) (37) 

p pure p pure \ / 

where \u\ < fi and \v\ < v. 



. However, (^) and (0) 
= x{l-x)\lMl. 



Proof of Lemma: By the results of sections |2.1j and |2]^, we can assume wlog that 
$ and VL are self-adjoint and diagonal, with eigenvalues (Ai, A2, A3) and (cui, a;2, cus) 
respectively, where A3 = /i > and uj^ = u > We first consider the case 
/i, 1/ > 1/3. It follows from (|2T|) that the eigenvalues Ai, A2 he in a convex set with 
extreme points 

(/i,/i), (/i,2/i-l), (2/i-l,/i), (-/i, -/i), (-/X, 1-2/x), (1-2/x, -/x) 
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If we let $1 = /i, fi], $2 = ^[/U, 2/i — 1, /i] etc. so that $j (j = 1 . . . 6) denote 
the stochastic maps corresponding to these six points, we can write $ = Yl^j=i '^j^j 
as a convex combination of these six maps and similarly for Vt = Yl^=i ^k^k- Then, 
since the entropy is concave we find 



\ j=l k=l J 

> 5^5^a,6fc5($,®f]fe)(p) (3^ 

j k 

> min{S(^$j ® fifc)(p) : j = 1...6, A; = 1 . . . 6} 



But now we note that $4 = T3 o and $5 = T3 o $2, where T^i^p) = azpcr^. 
Hence, e.g.. 



($5 ® ^4){p) = [o, ® a,) ($2 ® ni){p) 



so that 



and similarly for 5^$! ® Q^j (p) = 5^$! (g) (p) etc. Hence we can replace (p^) 

by 



5(<l>®l])(p) > min{5(^$j- ®l]fc)(p) : = 1,2,3}. 



(39) 



Since we also have infp pure 'S'|^$[p, u, p] (8> ^ip)j = infp pure 'S'|^$[m, p, p] ® f2(p) 
for any Q, and since $j, j = 1, 2, 3 and fi^. A; = 1, 2, 3 have the form given in the 
lemma, the result follows. 

For p < 1/3 we proceed similarly, but with the convex set given by the rectangle 
with corners (±p, ±p). 

4.2 Special Form of Pure State 

As shown in Section [4.1|, to show additivity of minimal entropy for unital stochastic 



maps it is sufficient to consider self-adjoint maps of the special form $[p, u, p] and 
V, v\. In this section we prove additivity for these maps over a special class of 
entangled states. 
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Theorem 11 Let and Q[i>,v,i>] be diagonal stochastic maps, satisfying 

H > \u\ and v > \v\, so that fj, and u are the largest eigenvalues of $ and Q 
respectively. Let lip) be a pure state of the form lip) = a\0 0) + e^^d\l 1). Then 



S{<^®Q){\ij){ij\) > h{fi) + h{v) 



inf 5[$(p)] + inf5[fi(7)]. 

P 7 



(40) 



We prove Theorem |Tl| in the next section. Here we derive some intermediate 
resuhs which will be used in the proof. For generality we consider diagonal maps 
$[Ai,A2,A3] and VL[uji,uj2,uj^, and for definiteness we also assume that IA3I and 
\uj^\ are their largest singular values. 



We find the density matrix for a pure state of the form 
with a, d real and + d'^ = 1. Then if p = \'ip){ip\ 



a|00) +e*V|ll) 



P 



( ade^^ \ ( 





\ ade~'^ d"^ J \e 

\a [I ® I + a;,® 0;, + I ® az + a. 
-a) + a,0a,- 



a 












1-a ) 



Or. 



(41) 



(42) 



+ [ cos Q (cTa; ® Ox — Oy® Oy) — sin Q (O^ ® (Xy + Oy ® Ore)] 

where a = and t = 4a(l — a), so that t G [0, 1]. Applying the stochastic maps 
gives 

[$ ® Q] (p) = [/ ® / + Xsujsa, ® a, + u^I ® a, + X^a, ® /] (43) 

- a) [/ (g) / + Asco-so-^ ® cr^ - UJ3I ®)(Jz- Aso-^ (g) /] 

+ \Vt [C0S6 (AiCUida; ® (Jx - \2^2(Xy ® 0"y) 

— sin^ {\iijj20'x ® o-y + \2UJ1ay ® a^)] 

Notice that because $ and VL are diagonal, the result remains a linear combination 
of terms of the form 0"^ ® 0"^ (/c = . . . 3) and I ® and cr^ ® J; no cross terms 
of the form ® etc arise. Therefore, the only non-zero terms in [$ ® f2](p) lie 
along the diagonal or skew diagonal. Thus [$ ® f2](p) can be written in the form 

/X x\ 

X X 
X X 

yx ox/ 

where X denotes a non-zero matrix element. Thus, [$ ® ^](p) is equivalent to a 
block diagonal matrix. 
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A straightforward computation shows that these blocks, which we refer to as 
"outer" and "inner" can be written respectively as 



1 + A3a;3 + Vr^(A3 + 0^3) lVi{e'^X+u;+ + e-'^A_u;_) 
|v^(e^^A_a;_ + e-'^X+uj+) 1 + A3a;3 - Vr^(A3 + UJ3) 



(44) 



and 



1 - A3CJ3 + VT^(A3 - Us) i v^(e^^A+cu_ + e''^\.u+) 



1 - A3a;3 - a/I - ^(As - ujs) 



(45) 



where A± = Ai ± A2 and similarly for u!±. 
The first has eigenvalues 



A3a;3 



;i - t)(A3 +a;3)' + it(A^a;^ + Xiu'_ + 2cos(2^)7) 



1/2 



(46) 



while the second has eigenvalues 



1 - A3a;3 



±1 



1 1/2 

;i - t){Xs - co^f + ii(AX + + 2cos(2e)7) ] (47) 



where 7 = A+A_a;+a;_. 

Minimum entropy occurs when both pairs of eigenvalues are spread out as far 
as possible. This happens either at ^ = (if 7 > 0) or at ^ = 7r/2 (if 7 < 0). We 
will be interested in the case Ai > IA2I and Ui > \ou2\, which means that 7 > 0, so 
we assume that = henceforth. 

We need more compact notation for the eigenvalues. Define 



fit) = ( (A3 + 6^3)' - tliXs + - {Xiui + A2u;2)'] 



1/2 



and 



g{t) = ((A3 - UJsf - t[{X3 - Usf - (AlCUl - A2CU2)']) 

Also define 

A = 1 + X3UJ3, B = 1- X3UJ3 
Then the four eigenvalues of (8) Q] (p) at ^ = are simply 

l[A±m], i[B±g{t)] 



1/2 



(48) 



(49) 



(50) 



(51) 



The state l'^) is unentangled when t — and maximally entangled for t = 1. 
We want to show that the entropy is minimized a,t t — 0. We will let S{t) denote 
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the entropy of [$ ® Q] (p) considered as a function of t. To analyze its behavior, it 
is convenient to use the function 



ri{a, x) = —{a + x) \og{a + x) — {a — x) log(a — x). (52) 
It follows that 

S{t) = lv[AJ{t)] + lr^[B, git)] + log A, (53) 

where we have used the fact that ^{A + B) = 1. To find the minimum of S{t) it 
suffices to analyze the behavior of ri[A, f{t)] and ri[B,g{t)]. First observe that 

i,iA/w] = rwiog^ (54) 



and 



^v[A,m] = r(t)iog^^4i-i/'wr 



dt^'' ''''' ' ^A + f{t) A-[f{t)Y 

OA 

- T^lf{f^2lf"it)fit)-\f'(t)\"] (55) 

if f"{t) < and < f{t) < A. This follows from the elementary inequality 
log (fzf ) > Yz|2 (which holds for x E [0, 1]) applied to a; = fit)/A. Now f{t) 
is a function of the form \/a — ht for which one easily checks that f"{t) < 
and \f"{t)\f{t) — |/'(t)P = 0. Therefore, it follows immediately from (|55D that 
^rj[A,f{t)\ < 0. Since g{t) also has the form ^Ja — ht, a similar argument holds 
for ri[B,g{t)\. Hence S"{t) < from which we conclude that S{t) is a concave 
function on [0, 1], and therefore attains its minimum at either t = or t = 1. 

Hence to prove that S attains its minimal value at t = 0, it is necessary and 
sufficient to show that >S'(1) > 5'(0). In essence, we have shown that for a state of 
the form given in Theorem 1 the minimal entropy is attained for either a maximally 
entangled state or a simple product state. 

If we again think of f{t) in the form y/a — bt, then ( pHD (together with our 
assumption that A3 and are the maximal singular values) implies that 6 > 0. 
Combined with (0) this implies that ri[A,f{t)] is increasing. However, this need 
not be true for g(t). For example, when A3 = uj^ and t = 1, g'{l) = \ \\iOJi — \20J2\ 
which implies that g{t) is increasing and ri[B.,g{t)\ decreasing. Thus, the general 
situation is that the entropy S{t) is a linear combination of two concave functions 
corresponding to the contributions from the "outer" and "inner" eigenvalues re- 
spectively. The former is always increasing, while the latter can be decreasing as 
t goes from to 1. To illustrate this. Figure shows how the eigenvalues of the 
product state move in the case where A3 = a;3 = /i. The "inner" eigenvalues are 
both 1(1 — /x^), and they move apart as t increases away from 0, which lowers their 
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contribution to the entropy. The "outer" eigenvalues |(1 ± /i)^ move together as t 
increases, which raises their contribution to the entropy. 
To examine the difference >S'(1) — S'(O), we note that 

^(l) = log4 + i?7(l + Agt^a, lAit^i + A2t^2|) + ^^7(1 - Ascua, lAit^i - A2t^2|) (56) 
5(0) = log4 + ir7(l + A3c^3,A3 + c^3) + ^77(1 - Agc^s, IA3 - c^sl) (57) 

so that 

4[5(l)-5(0)] = r7(l + A3a;3,|Aia;i + A2a;2|)-r/(l + A3a;3,A3 + a;3) (58) 
+77(1 - A3CJ3, \Xiuji - A2u;2|) - vi.^ - Ast^s, IA3 - uj^l). 

Since §^v{'^^ ^) = log < if a; > 0, T]{a, x) is decreasing in x. By our assump- 
tions, A3 > I All > lAiCiJil and > \uj2\ > |A2a;2| so that 

A3 + Co'3 > \\lUJi + \2UJ2\, 

and hence the difference of the the first two terms in (^) (which corresponds to 
the change in entropy from the "outer" eigenvalues) is always positive. The change 
from the inner eigenvalues need not be positive however; indeed, when A3 = 
it must be negative. Thus we need to show that the contribution from the inner 
eigenvalues cannot dominate. 

We gain some intuition from an elementary analysis of r]{^,x). This function 
is largest near a; = 0, where it is fiat, but has its largest derivative near x = ±1. 
Hence one expects the change from the larger " outer" eigenvalues to dominate. 
Explicit analysis of the extreme points in the next section verifies this. 



For the proof of Theorem |Tl| we will restrict to the values Ai = A3 = ;U and 
X2 = u with \u\ < fi, and ui = = u and uj2 = v with |f | < u. In this case (^) 
becomes 

4[S'(1) - 5(0)] = 77(1 + /iz/, /iz/ + Mw) -77(1 + /iz/,/i + z/) (59) 
+77(1 — /iz/, /iz/ — uv) — 77(1 — /iZ/, /i — z/). 



4.3 Analysis of Extreme Points 

In this section we will complete the proof of Theorem |Tl]. By the argument in 
section ^A\ , it suffices to consider either u = ±/i with fi G [0, |], or m = ^ with 
yU G [|, 1], or M = 2/i — 1 with fi G [|, !]• Similarly for Q: either v = ±iy with 
z/ G [0, |], or V = u with u G [|, 1], or t> = 2z/ — 1 with u G [|, 1]. So we wish to 
prove the positivity of S{1) — S{0) for these values of the parameters. 

The simplest case occurs when u = fi, v = u. In this case ( pOf ) becomes simply 

4[5(l)-5(0)] = 7]{1 + i2u,2fiiy) -7]{1 + + (60) 
+r]{l - /iz/, 0) - ■r]{l - ixu, fj. - u). 
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Since 77(0;, x) is decreasing in \x\, the first term dominates the second, and the 
third term dominates the fourth, hence S{1) — S{0) > in this case. 

The remaining cases are handled numerically. It is useful to first consider a 
special case, namely n = u. This arises when the convexity argument is applied 
to the product channel $ CS) $, which is the situation of most interest to us. Then 
(|0D yields 



4[S'(1) - 5(0)] = ~{l + 2ij,^ + uv)\og{l + 2ij,^ + uv) 

-(1 - 2/i2 + uv) log(l - 2fi^ + uv) - 2(1 - uv) log(l - uv) 
+4(1 + /i) log(l + 4(1 - 12) log(l - 12). (61) 



Graphing verifies that (|6lD is positive for the two extreme values uv = ±/i^ in 
the range < /i < | (see Figure ^, and for the three extreme values uv = /x^, 
uv = /i(2/i — 1) and uv = (2/i — 1)^ in the range | < /i < 1 (see Figure |^). The 
graphs show that (|6l|) is smallest when /z ~ 0, 1, so we analyze these regions more 



carefully. In the first case when < yU < | and uv = —fi"^, we expand around 
/i = 0. This gives 



A[S{1) - S{0)] ~ 4/1^ 

In the second case when | < /i < 1 and uv = /i(2/i — 1), write x = 1 — /i, and 
expand in x: 

45(1) -45(0) ~ 3a;log- + x[7(l + log4) -61og3-4(l + log2) 

X i 

~ 3x log - + 3.34a;. (62) 

X 

This is manifestly positive for x small. Similarly in the case uv = {2fi — 1)^ we 
have 

45(1) -45(0) ~ 4xlog- + 4x(l -log2) 

X 

~ 4x log - + 1.23a; (63) 

X 

The general case /i 7^ z/ is handled similarly. We have 

4[5(l)-5(0)] = ~{l + 2fxiy + uv)\og{l + 2fxu + uv) 

— (1 — 2yUi/ + uv) log(l — 2/iz/ + uv) — 2(1 — uv) log(l — uv) 

+2(1 + /i) log(l + /i) + 2(1 - /i) log(l - /i) 

+2(1 + u) log(l + z/) + 2(1 - u) log(l - u). (64) 

By symmetry it suffices to assume that u < jji. For < /i < | and < z/ < 
we have two extreme values uv = -^[xu. Graphing (|6^) shows that it is positive in 
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both of these cases. Again the smallest values occur near /i = z/ = 0, so we expand 
( |5^ around this point. For both values uv = ^fiu this gives 

4[5(l)-5(0)]~2(/i2 + zy2). 

For I < /i < 1 and < z/ < |, there are four extreme values uv = ^u, uv = —fii', 
uv = (2yU — l)z/ and uv = — (2yU — l)z/. The graph of ( [6^ ) is positive in all cases, with 
smallest values around /i = |- For | < < 1 and | < z^ < A^, there are also four 
extreme values, uv = fiu, uv = /i(2i/ — 1), uv = (2/i— and uv = (2/i — l)(2z/ — 1) 
(see Figure ^ for the last of these). The graphs of (|6^ ) are positive in each case, 
and the smallest values occur near fi = u = 1. This region can be analyzed more 
carefully by expanding the functions to leading order in x = 1 — and y = 1 — u. 
For example, when uv = (2yU — l)(2z/ — 1) the expansion of (|6^ yields 

4S'(1) - 45(0) ~ 2[x log X + y logy -2{x + y)log{x + y)] 

+a;(2 + 21og2) + ?/(2 + 21og2) (65) 

Using convexity of the function xlogx, we can bound ( p5D from below hj 2{x + y) , 
which demonstrates positivity for x, y small. Similar results are obtained for the 
other cases. 

4.4 Mixing Discussion 

In this section we extend Theorem |rT| to pure states {ip) formed from any entan- 
glement of states of minimal entropy. After a precise statement of this and proof 
of this extension, we discuss its interpretation and the evidence for more general 
validity of Conjecture |l]. 

Let $ be a unital stochastic map. As explained in Appendix A, we can write 
$ = f/$5f/"l" where f/ is a unitary operator and $5 is self-adjoint. Let fi = \\S\\. 
Define 

£($) = {p = 1(7 + N) I <^s{N) = i/xAT}. (66) 

In words, C{^) is the collection of density matrices which lie in the direction of the 
largest eigenvalue of $5. If this largest eigenvalue is non-degenerate, then £($) is a 
line segment between antipodal points on the Bloch sphere. In case of degeneracy 
it may be a disk, or even the entire Bloch sphere. 

If pi2 is a density matrix on C^®C^, we denote by pi = T2(pi2) and p2 = Ti{pi2) 
the reduced density matrices on obtained by taking the indicated partial traces. 



Theorem 12 Let $ and be unital stochastic maps. Let pi2 be a density matrix 
on C^, such that pi lies in >C($), and p2 lies in C{VL). Then 

S{^®^){pi2) > inf5[$(p)] + inf5[fi(7)]. (67) 

p 7 
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Proof: Wwe assume wlog that $ and Q are diagonal maps in the form $[Ai, A2, A3] 
and Q[u!i,LJ2, u^]. Furthermore we can arrange that A3 = /i is the largest eigenvalue, 
so that |[/ ± 0-3] lies in C{^). Similarly we can arrange that cus = z/ is the largest 
eigenvalue, so that |[/ ± crs] also lies in C{Q). 

Wlog we can assume that pu = IV') ("^ I is a pure state. In the bases which 
diagonalize $ and fl, we have 

IV) = aoolOO) + aoi|01) + aio|10) + an|ll) (68) 

Define the matrix A to be 



aio ctii 

Then as shown in Appendix A, the reduced density matrices are 

Pi = AA\ p2=(^A^Ay (70) 

We obtain the "Schmidt decomposition" by applying the singular value decompo- 
sition to A. The result is a new basis in which has the diagonal form assumed 
in Theorem 0. This decomposition is obtained by finding unitary operators Ui, U2 
which diagonalize AA'^ and A^A respectively, so that UiAU\ is also diagonal. By 
assumption pi lies in >C($), and hence so does AA^ . If p is non-degenerate, then 
/^($) is the line segment consisting of the diagonal density matrices. Therefore 
AA^ is also diagonal, so Ui is equal to the identity, up to a phase. If v is also non- 
degenerate then U2 is also proportional to the identity, and hence pi2 is already in 
diagonal form. The result follows immediately by applying Theorem |Tl|. 

In general either or both of p and v may be degenerate. For example, if p is 
2-fold degenerate, then £($) is a disk containing the 2;-axis in the Bloch sphere. 
So AA^ lies in this disk, and hence it is diagonalized by a rotation of the Bloch 
sphere which preserves this disk. By definition, such a rotation commutes with the 
action of since the plane which contains this disk is an eigenspace of $. Hence 
the unitary operator Ui commutes with $. Similarly if p is 3-fold degenerate, then 
every unitary operator commutes with $. To apply the argument to f2, note that by 
assumption p2 lies in £(f2). If v is non-degenerate, or is 3-fold degenerate, the same 
argument applies. If v is 2-fold degenerate, then C[VL) is a disk. The transpose 
operation on the Bloch sphere is the refiection in the xz-plane, and this does not 
in general preserve a disk containing the z-axis. However we have assumed that 
VL is diagonal, and hence £(f2) either lies in the xz-plane or the yz-plane. In both 
cases the transpose leaves CiVt) invariant, and hence the same argument can be 
applied to deduce that U2 also commutes with f2. It follows that ($ ® r2)(|V')(V|) 
is unitarily equivalent to ($ ® VL){\ip'){ip'\) where |V') has the form assumed in 



Theorem 11, and hence the result follows. 



Remarks: 
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1. The set C{^) contains the states of minimal entropy for $. Theorem |l^ shows 
that by entanghng the minimal entropy states of the individual channels we 
cannot decrease the entropy of the product channel. Since it seems unlikely 
that entangling states of higher entropy will improve the situation, we present 
this as strong evidence for our conjecture. 



2. We illustrate Theorem [l^ in the case where <l'[Ai, A2, A3] is self-adjoint and 
diagonal, with Ai = A3. If the {ip) is real, i.e., the matrix A = ajk is real, then 
the 2x2 unitary matrices Ui and U2 which diagonalize AA^ and A'' A can be 
chosen real and orthogonal, in which case we emphasize this by writing them 
as Oi and Now suppose that our original orthogonal basis |0), |1) on 
corresponds to the eigenvectors of so that the corresponding pure state 
projections are |[/ + w ■ a], with w = (0, 0, 1) corresponding to the "North 
pole" of a sphere. Each unitary 2x2 matrix U can be associated with a real 
orthogonal 3x3 matrix, and the effect of U on the basis vectors corresponds 
to a rotation on the sphere or the action of a real orthogonal 3x3 matrix 
on w. When the original 2x2 matrix is real orthogonal, the corresponding 
rotation on the sphere reduces to a rotation in the xz-plane. 

If we now write the unital stochastic map $ in the form 
$:p=|[/ + w-cT]^ $(p) = l[I + Twa], 

then the change of basis is equivalent to replacing T by OTO^^ where O 
denotes the 3x3 orthogonal matrix associated with O. Thus, for example, 

/ a/r, • /)/o\ / cos 6* sin 6* \ 

when O = f O = 1 . If T ,s 

diagonal with eigenvalues A^, = A^, then OTO~^ = T. Hence every state \ip) 
with real coefficients can be diagonalized, and the matrix T is unchanged. 

3. Suppose that $ has one very large singular value and two small ones. Then 
the unit sphere corresponding to the set of density matrices is mapped into 
an ellipsoid shaped like a football, and the states of minimal entropy lie at 
the ends of the football (see Figure |). We can interpret Theorem 11 as 
saying that entangling these minimal entropy states will not decrease the 
entropy below the sum of the minimum entropies. 

Now suppose that we always keep two eigenvalues equal, but vary the pa- 
rameters so that the ends of the football move in until it becomes a sphere 
and then a "flying saucer" (see Figure ^). The ends of the football have 
moved in to states corresponding to maximal entropy. The minimal entropy 
states now form a great circle. As we explained above, our special form for ip 
allows a general entanglement of states corresponding to these great circles of 
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minimal entropy. Yet even this more general entanglement does not decrease 
the entropy below that of product states. 

4. The discussion above shows that, at least in the case of unital maps, if Con- 
jecture does not hold, then the entanglements which use states of higher 
entropy would achieve a lower entropy on the product space than entangle- 
ments of states of minimal entropy. In addition, such entanglements would 
have to lower the entropy without increasing the largest eigenvalue of $(^12) 
beyond the product value given in Theorem |. We do not find this plausible. 



5 Non-unital Maps 

In this section we give some heuristic evidence to support Conjecture |l|. Before 
doing so, we illustrate the differences between unital and non-unital maps by dis- 
cussing some of the properties of a special class of maps on C^^^. 

A non-unital map is one for which $(/) 7^ /. This means that it takes a ran- 
domly distributed alphabet to a non-random distribution. One would intuitively 
expect that the maximum capacity would then be achieved for alphabets which 
are not evenly distributed. Although this is true classically, it need not be true for 
quantum stochastic maps as shown by the example below. 

It follows from equation (pJ]) that a unital stochastic map defines a linear map 
on the subspace of matrices with trace zero. However, a non-unital map is afiine 
when restricted to this subspace. 



5.1 Special subclass 



We now consider non-unital maps which correspond, in the notation of Section 
to the matrix 



T 



/I 
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As/ 



(71) 



with t ^ 0. This is easily seen to yield the map 

mil + wa]) = i [/ + WiXiai + {t + w^X^)a3] . (72) 
A3 = this is equivalent to the "splaying" channel introduced 



If Ai 



-1= and t = A3 = |, 

by Fuchs |T^ to demonstrate that there exist stochastic maps for which the Holevo 
capacity (P) is achieved only by non-orthogonal states. The case A3 = was con- 
sidered in Ijl^ in a different context. The case Ai = is essentially classical, i.e., if 
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p is restricted to the subset of states of the form ^[Jiwcra], the action of is equiv- 
alent to the action of the column stochastic matrix ^ f! + ! + ! " ! ^ ^ ^ on 



the probability vector ^ 



l + w 
1 — w 

Since equality in p^ ) holds for Fuchs example, it can be regarded as an extreme 
case. Because A2 = for this class of maps, they map the unit sphere of density 
matrices into the ellipse in the x-z plane satisfying the equation 

1 + ^-1 (73) 



A? A 



3 



In the special cases, Ai = and A3 = 0, these elhpses become vertical and hori- 
zontal line segments respectively. 

In the classical case (Ai = 0) it is not hard to show that the maximal capacity 
is never achieved for vr = i. One has only two pure states p± = |[/ ± a-s] (which 
are orthogonal) for which ^[^-[-(p)] is not identical. 

By contrast, for the non-unital quantum case with Ai > A3, it appears, in 
general, that maximal capacity is achieved at vr = | and with non-orthogonal 
states. Moreover, these non-orthogonal states need not correspond to the minimal 
entropy states. Some insight into these observations can be obtained by looking 
at the ellipse (^) in Figure ^ corresponding to Fuchs channel. (Fuchs showed 



explictly that non-orthogonal states are required to achieve maximum capacity, and 
that the maximal capacity achievable with orthogonal states occurs for vr = i.) 
The endpoints of the ellipse (denoted A±) correspond to $(^[/ ± cti]) and have 
entropy h[^{l + 2/3)] while the minimal entropy states (denoted C±) correspond 

to the images of $ / + ( ± ^, 0, i)-cr j and have entropy /i[|(l + 1/v^)]. Note 

that this is the point at which the ellipse meets the circle + = |, which is a 
level set for the entropy on the Bloch sphere. 

The states |[/±(Ti] are the only pair of orthogonal states with identical entropy. 
If one tries to move from toward C+ to lower the entropy from one of a pair 
of orthogonal states, the other orthogonal state must move along the ellipse away 
from A_, down and closer to the origin and hence has a higher entropy than 
A_. Explicit computation shows that the entropy price paid by moving away 
from A_ is greater than that gained by moving A^ toward C+ . For any pair 
of states Pi {i = 1,2), the state p = npi + (1 — 7r)p2 which occurs in (^ lies 
along the line segment between pi and p2. For the points A± it is easy to see that 
the maximum capacity will occur for 11 = ^. If one does not require orthogonal 
states, then simultaneously moving both A± toward C± to decrease the entropy 
seems advantageous. However, this will also decrease the entropy of the convex 
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combination p which increases the capacity. Hence, maximal capacity does not 
occur at the minimal entropy states but at states on the ellipse which lie between 
C± and A±. (As long as the move is symmetric, maximal capacity will occur at 
TT = |. Symmetry suggests this is optimal, but that does not seem to have been 
proved explicitly.) 

One expects similar behavior for any map $ of the form (l7l|) for which Ai > A3. 
When Ai < A3 the major axis of the ellipse lies along the x-axis, there is only one 
state of minimal entropy at the "top" of the ellipse, and one expects the channel 
to behave more and more like a classical channel as the ratio A1/A3 increases. 

When A3 = 0, the ellipse becomes a horizontal line so that the minimal entropy 
states and the endpoints of the ellipse coincide. Hence, the maximal capacity is 
again achieved for orthogonal states. In the limiting case + \\ = 1, the endpoints 
of the ellipse lie on the unit circle of pure states and hence, have entropy zero. 
However, the capacity does not achieve its maximum value of log 2 but instead has 
the value h{t). 

As was noted earlier, $ always has a fixed point. For channels of the form 
(fnl), this fixed point is at (0, 0, j^)- However, we have been unable to attach any 
significance to the fixed point. (For Fuchs channel, the fixed point is at i[C+ + C_.]; 
however, this seems coincidental.) 

At first glance, it might seem that the price paid for the versatility of non-unital 
channels is too great. If we fix A2 and A3, then when t ^ the requirement (^) 
implies that Ai must be smaller (i.e., "noisier") than for the corresponding unital 
channel. For example, using Fuchs values t = A3 = I, A2 = 0, one finds that Ai = 

=^ 0.577 is optimal and corresponds to the least "noisy" direction. However, 
if t = 0, one could increase this to Ai = 2/3 ~ 0.667 with corresponding decrease 
in noise so that the minimal entropy (which comes from the states |[/ ± a^]) is 
h{0.667). For the non-unital channels these same states would yield an entropy 

of only /i(0.577). However, the states ^ J + ( ± ^,0,|)-(j will emerge with 

entropy /i(0.707). The decrease in eigenvalues of the part of $ corresponding to 
the restriction to matrices of trace zero, is overcome by the contribution to the 
emerging state of the non-unital part of the map. We expect that this behavior is 
generic for non-unital maps. 



5.2 Minimal entropy 

Recall the discussion in Section p7T| For unital maps, the generic situation is that 
there are two states of minimal entropy corresponding to the endpoints of the major 
axis. In the generic situation for a non-unital map, unless t is perpendicular to the 
major axis, the ellipsoid in ([T8|) will have only one state of minimal entropy. Hence 
any entanglement will require mixing with a state of higher entropy. In the case of 
a map (such as the Fuchs map (|72|) with IA3I > |Ai|) for which the translation t is 
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perpendicular to the (non-degenerate) major axis of the elhpsoid, there will be two 
states of minimal entropy. However, because the translated ellipsoid is not centered 
at the origin, these two states will not be the images of two orthogonal states, but 
rather the images of two non-orthogonal pure states corresponding to vectors 
and IV'2)- This suggests that the best candidate for a minimal entropy entangled 
state should have the form \E' = Xljfc'^jfclV'j ® V'fc)- However, after changing to 
orthogonal bases and using the SVD decomposition (pSf), this can be rewritten 
in the form = a\xi ® Xs) + d\x2 ® X4) where (xi,X2) = {XsiXi) = and, at 
most, only one state in each of the pairs Xi:X2 and X3;X4 can equal either ipi or 
ip2- Hence any entanglement must include states which are mapped into states of 
higher entropy under the action of $5. Thus one expects states of minimal entropy 
under the action of $5 ® $5 to have the product form \ipj ® tpi^). 

In the case of two-fold degeneracy (of the form \Xj\ = |Afc|), a shift orthogonal to 
both major axes will yield a circle of states of minimal entropy; however, that circle 
will not correspond to a "great circle" but rather a circle of constant latitude. Such 
a circle never includes the image of two orthogonal states and hence the argument 
above still holds. In the case of three-fold degeneracy, the image will be a sphere 
and any shift will again yield only a single state of minimal entropy. 

Since the maps $ and $5 differ only by rotations of the Bloch sphere, the same 
conclusions hold for $. Moreover, our analysis suggests that for any pair of maps 
$ and Q, the states which yield minimum entropy under $ ® will be simple 
product states, regardless of whether one or both are non-unital. 

6 Conclusion 

Our main result in this paper is that channel capacity is additive for unital chan- 
nels, where unital means that the channel maps the totally random input state 
(whose density matrix is proportional to the identity) into itself (for example, ev- 
ery self-adjoint channel is unital). Specifically, we present strong evidence that 
both the Shannon capacity (no entanglement for input states or output measure- 
ments) and the Holevo capacity (inputs unentangled, but output measurements 
may be entangled) are additive over two uses of a two-dimensional unital channel. 
This is the first result that establishes additivity of channel capacity for a broad 
class of quantum channels. 

We show that the problem reduces to finding the states of minimal entropy for 
two uses of the channel. If these minimal entropy states are product states, then this 
implies additivity of capacity. We then prove that one of two possibilities occurs: 
either these minimal entropy states are product states, or else they are entangled 
states whose reduced density matrices have greater than minimal entropy. We 
argue that the latter case is very unlikely (numerical experiments confirm this), 
and so conclude that the former is true. As further supporting evidence, we prove 
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that the maximal norm of states which emerge from the channel is multiplicative 
over two channel uses. 



Our results rely heavily on the Stokes parametrization and the properties of 
the image of the Bloch sphere under stochastic maps. In order to extend them to 
higher dimensions, we would need an effective parameterization of the subspace of 
matrices of trace zero in higher dimensions. One can always write a density matrix 
in C'^^'^ as p = ^[/ + iV] where TrA^ = 0. However, for c? = 4, we do not know 
what the analog of the Bloch sphere looks like. We know only that its boundary 
corresponds to those with eigenvalues +3, —1, —1, —1 which is not the analogue 
of the surface of a sphere. Without knowing the geometry of this region, we can 
hardly hope to answer the important question of how it transforms under maps 
of the form $ (8> Thus, we have been forced to use indirect methods to reach 
conclusions about the states of minimal entropy emerging from $ ® 

Our results have implications for the design of communication channels. Ideally, 
one wants to eliminate all noise. However, this will not be practical and one wants 
to know how best to allocate resources. 

In the case of unital channels, minimal entropy and maximal capacity are 
achieved if signal codes are chosen to correspond to the least noisy "direction" 
or polarization. (Here, we use "direction" in the sense of the Bloch sphere or max- 
imum Afe in our notation. This is unrelated to direction of signal transmission.) 
Hence, if maximizing capacity is the primary goal, then it would seem sufficient 
to minimize noise in only one direction. Even if the orthogonal directions are ex- 
tremely noisy, signals sent using optimal codes will not be affected. However, in 
this case "classical communication" becomes truly classical. If codes are restricted 
to one direction, then one is back in the classical situation with one choice for 
encoding and 1. One has effectively lost the versatility of rotating the code basis 
as a tool for such purposes as signal encryption. 

Non-unital channels have far more versatility, some aspects of which were dis- 
cussed briefly in section |5.1|. Much more work needs to be done analyzing the 



properties of non-unital channels. Thus far, most authors have looked for exam- 
ples of particular maps which illustrate particular facets of stochastic maps (such 
as Fuchs example demonstrating the possibility of maximizing capacity with 
non-orthogonal states). Our approach has been to try to find parameters which 
characterize subclasses of stochastic maps with certain properties. As summarized 
in Appendix C, most of the known examples of noisy channels can easily be shown 
to belong to one of the groups discussed in this paper. A complete analysis of 
non-unital maps would seem to require an extension of conditions of the type (p^) 
to general maps of the form (|T^ with tk, \k 7^ 0. 
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A Singular Value, Polar and Schmidt Decompo- 
sitions 



In this paper we make repeated, and sometimes subtle, use of the singular value 
decomposition (SVD) of matrices. In view of this, and of some confusion in the 
literature about the connection between the SVD and what is often referred to 
as the "Schmidt" decomposition, we provide a brief summary and review of these 
closely related decompositions and their connection to the better known polar 
decomposition. 

We begin with the polar decomposition which can be extended to bounded, 
and even some unbounded, operators [^] on a Hilbert space. 

Theorem 13 (Polar Decomposition) Any m x n matrix A can be written in the 
form A = U\A\ where the n x n matrix \A\ = V A^A is positive semi- definite and 
the m X n matrix U is a partial isometry. 

The term partial isometry means that WU (or, equivalently, f/f/^) is a projection. 
In general, U need not be unique but can be uniquely determined by the condition 
kerf/ = ker A. If A is a square n x n matrix, then U can instead be chosen (non- 
uniquely) to be unitary. Since |y4| is self-adjoint, it can be written as |y4| = VDV^ 
where D is a diagonal matrix with non-negative entries and V is unitary. Inserting 



this in Theorem |I3] with U chosen to be unitary yields the SVD since W = UV is 
also unitary. 

Theorem 14 (Singular Value Decomposition) Any nxn matrix A can he written 
in the form A = WDV'^ with V, W unitary and D a positive semi-definite diagonal 
matrix. 

The non-zero elements of D are called the singular values of A. They are easily seen 
to be the eigenvalues of 1^41 and, hence, their squares yield the non-zero eigenvalues 
of A'^A. As an immediate corollary, one finds that A'^A and AA"^ are unitarily 
equivalent and that V and W are, respectively, the unitary transformations that 
diagonalize A^A and AA'' . These results can be extended to non-square matrices 
if the requirement that V, W be unitary is relaxed to partial isometry. 

Using the notation of Section we can apply the SVD to the 3x3 matrix 
T which corresponds to the restriction of the stochastic map $ to the subspace of 
matrices with trace zero. Because T is real, the matrices V, W can be chosen to 
be real orthogonal so that we can write 

T = CiDO^ (74) 

where Oi, O2 are orthogonal and the superscript T denotes transpose. Now, every 
3x3 orthogonal matrix is either a rotation, or the product of a rotation with the 
inversion — /. Hence we can rewrite (|7i|) as 

T = Ri{±D)Rl = {RiRl)R2{±D)Rl (75) 
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where Ri and R2 are rotations, and we conclude that T can be written as 

T = RS (76) 

where S is self-adjoint and R is a rotation. If $ is unital, define the map $5 by 

^si'WoI + w-a) = wqI + Sw-a (77) 

Since every rotation is implemented by a unitary transformation on C^, there is a 
unitary operator U such that for any state p, 

^p) = U^s{p)U^ (78) 

For non-unital maps, a similar argument can be used to show that any stochas- 
tic map has the form ( |TI| ) where the restriction of $ to the matrices with trace 
zero has the form (|76D and t —>■ R^t, i.e. $(p) = U^sip)^^ where 

^s{wqI + ^-a)=WQl + {R^t + Sw)-a (79) 

and R is the rotation on corresponding to U . 

By construction, either S or —S is positive semi-definite. However, if Tfc(p) = 
o'kpo'k, then composing with the diagonal map $d (wqI + w-cr j = wqI + Dw-a 
merely changes the signs of two of the diagonal elements of D. Since Uak is also a 
unitary map, we can drop the restriction that S be semi-definite by modifying U 
if necessary. This is useful because, as we will see in the next section, the general 
conditions on the eigenvalues of a matrix 5* corresponding to a self-adjoint map 
$ include the possibility of negative and positive eigenvalues. We can summarize 
this discussion in the following 

Theorem 15 Any stochastic map $ on C^^^ can be written in the form $(p) = 
U^s{p)U'' where U is unitary and $5 is a stochastic map whose restriction to 
matrices with trace zero is self- adjoint. 

It may be worth noting that we can apply the polar decomposition theorem 
directly to a completely positive map If we use $ to denote the adjoint with 
respect to the Hilbert-Schmidt inner product, then Theorem |13| implies that we 
can write 

$ = To|$| (80) 

where |$ I = o $ and T is a partial isometry. If $ takes an algebra (e.g., C"^") 
to itself, then T can be chosen to be an isometry, i.e., TT = TT = I. However, 
neither |$| nor T need be stochastic in general (even though their composition 
is). On the contrary, if $5 has an odd number of negative eigenvalues, then 
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$s = r o |$| where T changes the sign of an odd number of eigenvalues and, hence, 
is not a completely positive map. (See Appendix C for further discussion and 
explicit examples.) 

What we have shown in the argument above is that for unital maps on C^^^ , 
the isometry T can always be implemented by a unitary transformation, possibly 
composed with a map F that takes [wqI + w-a] —>■ [wqI — w-a], i.e., there is a 

unitary matrix U such that T has the form T±(^WoI + w-crj = U[woI ± w-(t]W 

where one and only one sign holds. If $ is a non- unital stochastic map, then |$| 
will not even be trace-preserving and the isometry T will correspond to a change 
of basis that mixes the identity / with the three a matrices (in contrast to the 
unital case in which the change of basis affects only the subspace of traceless 
matrices spanned by the three Pauli matrices). Hence for non-unital maps, the 
full polar decomposition (^) of a stochastic map $ may be less useful than the 
polar decomposition on the restriction to matrices of trace zero. For unital maps 
on C"^" with n > 3 it would be interesting to know how much T can differ from 
a map of the form T(p) = U pU'^ where [/ is an n x n unitary matrix. 

In order to see the connection between the SVD and the so-called "Schmidt 
decomposition" , consider a wave function or vector of the form 

^ = ^ajkilJj®Xk (81) 

jk 

with {ipj} and {Xfc} orthonormal. It is not hard to see that there is an isomorphism 
between such vectors and operators of the form 

Kv, = 5^a,fc|^,)(Xfc| (82) 

jk 

and that Kq, is a Hilbert-Schmidt operator if and only if is square-integrable (in 
the case of wave functions). Moreover, if pi2 = |\1/)(\I'|, then 

Pi = T2{pi2) = K^kI 
P2=T,ipu) = {KlK^f 

where pi and p2 are the reduced density matrices obtained by taking the indi- 
cated partial traces T2 and Ti. The "Schmidt decomposition" is an immediate 
consequence of the application of the SVD to the operator given by (|^) which 
implies the following result. 

Theorem 16 (Schmidt) Any wave function of the form can he rewritten as 

^ = ^Pk'^k®Xk (83) 

k 

where pk o'^e the singular values of the matrix A, the bases {ipk} o-nd {xk} o'^e 
orthonormal and related by pki^k = K<i,Xk with K^, given by ((5^. 
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It follows immediately that the reduced density matrices pi and p2 have the same 
non-zero eigenvalues {pk'} and {ipk} and [Xk] are the eigenvectors of pi and p2 
respectively. 

There is an interesting history associated with both the SVD and Schmidt 
decompositions, as well as the attachment of Schmidt's name to (^) in the physics 
literature. In Chapter 3 of |T^, Horn and Johnson give a detailed account of the 
history of the SVD which goes back to Beltrami and Jordan who independently 
obtained the SVD for real n x n matrices in the 1870's. In 1902 Autonne obtained 
the SVD for general nonsingular complex n x n matrices and later made explicit 
the straightforward generalization to singular matrices in a long paper in 1915 
which seems to have been subsequently overlooked by many researchers. 

Independently, Schmidt obtained analogous results in 1907 for operators associ- 
ated with integral kernels. When quantum chemists became interested in density 
matrices in the 1960's, Carlson and Keller rediscovered some of his results. 
However, John Coleman [^j soon pointed out the connection with Schmidt's much 
earlier work. Coleman's observation probably made physicists and chemists aware 
of that work although he did not use the term "Schmidt decomposition" which has 
recently become popular in the quantum computing literature. Physical chemists 
were initially interested in the reduced density matrices which arise from a multi- 
particle wave function of the form \E'(wi . . . Wn) where Wi denotes the space and spin 
coordinates associated with the i-th particle. It was natural to decompose these 
coordinates into two subsets x = Wi . . . Wp and y = Wp+i . . . w„ so that the density 
matrices pi and p2 actually correspond to p-th and (N-p)-th order reduced density 
matrices of \E'. It is noteworthy that if the original wave function \E' has some 
symmetry, then the functions {ipk} and {Xk} can always be chosen so that (^) 
has the same symmetry. In particular, if \E' is antisymmetric, (as required by the 
Pauli exclusion principle for fermions) then these bases can be chosen so that (p^) 
is also antisymmetric; i.e., there is no need to apply an additional antisymmetrizer. 

In the case of multi-particle wave functions \E'(a;, y) is readily interpreted as the 
kernel of an integral operator (which corresponds to defined above) acting on 
an infinite dimensional Hilbert space. Hence, it is quite natural to attribute the 
results to Schmidt when used in this context. However, in quantum computation, 
one only considers the "spin" part of the wave function. Since this can always 
be represented by a finite dimensional matrix, the "diagonalization" of ajk in (pl|) 
can be obtained directly from the SVD; there is no need to detour into infinite 
dimensional Hilbert spaces to use results for integral operators. Moreover, since 
the matrix form of the SVD preceded Schmidt's work, it seems natural to use the 
term SVD decomposition. 
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B Matrix Representation of Stochastic Maps 

To prove the eigenvalue conditions (0), we rewrite the Kraus operators defined 
by (|ID in the form 

Ak = Vkol + Vfe-o- (84) 

where {vko, v^) is a vector in C^. We will let V denote the n x 4 matrix with 
elements Vkj, Vj its columns as vectors in C"" and V = (Vi,V2,V3). Using the 
relation 

{al + u-a){bl + w-a) = (a6 + u ■ w)/ + (aw + 6u + m x w) ■ cr (85) 
one finds 

3 

J=0 
3 

A\Ak = ^ \vkj\'^I + {vko^k + Vko-Vk - i^k X Vfc)-cr 

J=0 

so that 

3 

J2^kAl = (J]|l^,f)/ + 2(3fJ(K,,V) + J]5?vfex53vfc)-a (86) 

k j=0 k 

3 

J24Ak = (^|\/,f)/ + 2(3ft(\/o,V)-5^5RvfcxS5Vfc)-a (87) 

k j=0 k 

where (., .) denotes the standard inner product in and V = (Vi, V2, V3). Hence 
if $ is either unital or trace-preserving, then 

3 

= Tr01/ = 1. (88) 

j=0 

In addition, it follows from (|]) and (H) that 

Tr$(p) = Trp ^{Vq, V) + ^ 3?Vfe x 53vfc = (89) 

k 

$(/) = / ^ 3f?(K,,V)-^3fJvfcx!3vfc = 0. (90) 

k 

This implies that $ is both unital and trace-preserving if and only if, in addition 
to (|Sp, 

^{Vo,Y) = (91) 
^3fJvfcx53v, = (92) 
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where (0) can be rewritten as 

3(V,,\4) = 0, jV^e 1,2,3 (93) 

By defining V to be the matrix obtained by replacing the first column of V by 
Vq = iVo, one can rewrite conditions (|9lD and (|92D as the single requirement that 
that 53 (V^^V^) is diagonal or, equivalently, '^{Vj, Vk) = 0, j 7^ A; G ... 3. 

We will now derive the general form of the real 3x3 matrix T defined by (0) 
so that 

T,fc = iTr(or,<l>(afc)) (94) 

In general T is not symmetric; in fact, it is symmetric if and only if every operator 
Ak is self-adjoint. One finds after straightforward calculation that 

T,, = {Vo,Vo) + {V„V,)-J2{Vi,V^ (95) 
T,, = 2?it{V„V,)T2Q{Vo,Vk) (96) 

where the — holds in (p^) if {i,j,k} is an even or cyclic permutation of {1,2,3} 
and the + sign if it is an odd permutation. Thus, for example. 

Til = {Vo,Vo) + {V^,V^) - {V,,V,) - {Vs,Vs) 
T12 = 2^{Vi,V2)~2'^{Vo,Vs) 



In the special case where T is diagonal its eigenvalues can easily be obtained 
from (p5|). Let (Ai, A2, A3) be the eigenvalues, and define qj = {Vj, Vj) j = . . . 3. 
Then (|95|) becomes 

Ai = % + «?i - 92 - 

A2 = go - gi + 92 - qs (97) 
A3 = go - ?i - ?2 + Qs 

Together with the condition ( pHf ) which can be written as 

1 = go + «?i + «?2 + ^3 (98) 

this implies that the point with coordinates (Ai,A2,A3) must lie inside the tetra- 
hedron with corners at (1, 1, 1), (1, —1, —1), (—1, 1, —1), (—1, —1, 1). Furthermore 
by taking n > 4 and choosing the vectors Vo,Vi,V2,V3 to be orthogonal we see 
that every point in this tetrahedron defines a triplet of eigenvalues which can arise 
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from a unital stochastic operator. These conditions are equivalent to four hnear 
inequahties which must be satisfied by the eigenvalues, namely 



A1 + A2 < I + A3 

Ai — A2 ^ 1 — A3 

-Ai + A2 < 1 - As 

-Ai - A2 < 1 + A3 

which is equivalent to the more compact (pi]). 

These conditions were obtained earlier by Algoet and Fujiwara 



The expression in (U^) and also hold for non-unital $. To calculate the 



non-zero elements in the first column extend ( plf ) to j = and observe that the 
trace-preserving condition (|89| ) implies 9ft(Vo, V^) = — 3?Vfc x Q'Vfc. Using this 
in (13) yields 

tj = T,o = iTra,$(/) = 43ft(Fo, V,) (99) 



C Examples 

We now give some examples of unital and non-unital maps which illustrate some 
of the features discussed earlier and show the correspondence between our param- 
eterizations and some well-known examples which are usually described by their 
Kraus operators. Following the notation of Section p?T| , we will let $[Ai,A2,A3] 
denote a diagonal unital map. 

Before doing so we note that the classic example of a linear, positivity preserv- 
ing map which is not completely positive is the transpose, which corresponds to 
$[1, —1, 1]. We also note that if denotes the maps Tjt(p) = cr^pcrfc, the composi- 
tion TfcO$[Ai, A2, A3] changes the sign of the two eigenvalues whose subscript is not 
k. (For example T2 0<l>[Ai, A2, A3] = $[-Ai, A2, - A3].) Thus the map $[-1,-1,-1] 
which takes |[/ + w-a] ^ |[/ — w-cr] is also not completely positive. 

Examples of Unital Channels 

• Depolarizing channel: $[1-^,1-^,1-^] 

Ao = VT^I, Ak = ^/^ak (fc = 1,2,3) 

• BFS Ig] two-Pauh channel: $[x,a;,2x - 1] 

Ao = ^I, Ak = ^J\{l-x)ak, (fc = l,2) 
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• Phase-damping channel: $[1 — 1 — x, 1] 

Ao = VI - a;/, A± = v^i[/±(T,] 

• Rotation: A single rotation is the simplest example of a unital, non-self- 
adjoint map. Ai = U where U is unitary and det(f/) = +1. 

A convex combination of such rotations using Kraus operators = T]JJk 
where Ir^p = 1 yields a more general example of a unital map which is 
not self-adjoint, namely, = \Tk\^UlpUk- Then the associated 3x3 
matrix T = |rfcpi?fc where Rk is the rotation corresponding to Uk- 

The BPS two-Pauli channel was introduced by Bennett, Puchs, and Smolin 
1^ to demonstrate that entangled states could reduce the probability of error. 
It is worth noting that this map corresponds to the extreme points studied in 
detail in Section ^ Hence, it is particularly noteworthy that our analysis provides 
particularly strong evidence that entanglements do not decrease the entropy for 
these maps. It is also worth noting that when a; = | this map becomes $[|, |, — |]. 
Thus, although $[1,1,-1] and $[— 1,— 1,— 1] are not completely positive, both 
$[|, |, — |] and $[— |, — |, — |] are completely positive. 

Examples of Non-Unital Channels 

• Amplitude-damping channel 




/ 1 \ 







~ 
\t 1 / 

Equality in (Ai ± Aa)^ < (1 ± Xif - t^ 



• Puchs channel and related examples discussed in Section 
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Figure Captions 



Figure 1. The four eigenvalues of the product state of minimal entropy, 
with their direction of movement shown as u increases away from 0. 

Figure 2. The entropy difference 4[5'(1) — >S'(0)] for the two channels 

n, n] $[/^, A*, A*] (upper curve) and /i, /j] ® —/i, /i] (lower dashed 
curve), in the range < // < |. Note that the curves coincide over most the 
interval. 

Figure 3. The entropy difference 4[5'(1) — 5(0)] for the three channels 
ij] (8) fjL, fj] (dots), fjL, fj] 2// — 1, fj] (dashes), and 
2/1 - 1, //] (g) 2/1 - 1, fj] (full) in the range | < // < 1. 

Figure 4. The entropy difference 4[5'(1) — >S'(0)] for the channel 
2// — 1, (8) Q[i/, 2u — 1, i/] in the range | < < 1. 

Figure 5. The Bloch sphere and its image (the ellipsoid) under a unital map 
with one large and two small singular values. The endpoints of the ellipsoid 
are shown separately - these are the unique states of minimal entropy. 

Figure 6. The Bloch sphere and its image under a unital map with two 
large and one small singular values. The "waistband" of the ellipsoid is 
shown separately; this entire circle consists of minimal entropy states. 

Figure 7. The Bloch sphere (the circle of radius 1) and its image (the 
translated ellipse) under the Fuchs map, together with the circle of radius 
1 / y/2. The endpoints of the ellipse are marked A±, and the points of minimal 
entropy are marked C±. 
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1 I 1 1 I 

Figure 1: The four eigenvalues of the product state of minimal entropy, with their 
direction of movement shown as u increases away from 0. 
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Figure 2: The entropy difference 4[S'(1) — S{0)] for the two channels 

^[/i, 11, Ij] ® (upper curve) and /x, /x] ® — /x, /x] (lower dashed 

curve), in the range < < |. Note that the curves coincide over most the 

interval. 
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Figure 3: The entropy difference 4[5'(1) — S{0)] for tfie tfiree cfiannels 
/i, fi] ® /i, fi] (dots), fi, /i] (g) 2yU — 1, /i] (daslies), and 
<l>[/i, 2/i - 1, /i] (g) 2/i - 1, jj] (full) in the range | < yU < 1. 




Figure 4: The entropy difference 4[S'(1) — 5'(0)] for the channel 
2/i — 1, /i ® fi[z/, 2i/ — 1, u] in the range | < /i, < 1. 
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Figure 5: The Bloch sphere and its image (the elhpsoid) under a unital map with 
one large and two small singular values. The endpoints of the ellipsoid are shown 
separately - these are the unique states of minimal entropy. 




Figure 6: The Bloch sphere and its image under a unital map with two large and 
one small singular values. The "waistband" of the ellipsoid is shown separately; 
this entire circle consists of minimal entropy states. 
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Figure 7: The Bloch sphere (the circle of radius 1) and its image (the translated 
ellipse) under the Fuchs map, together with the circle of radius l/\/2. The end- 
points of the ellipse are marked A±, and the points of minimal entropy are marked 
C±. 
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